UL2 is a unified pre-trained model framework that uses a Mixture-of-Denoisers (MoD) as the pre-training objective, combines multiple pre-training paradigms, and is generally effective across various datasets and settings.
Large Language Model
Transformers English